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Abstract 

A new result on stability of an optimal nonlinear filter for a Markov chain 
with respect to small perturbations on every step is established. An exponen¬ 
tial recurrence of the signal is assumed. 


1 Introduction 

Stability of optimal filters is a topical research area in the last three or even more 
decades. In this direction, a lot has been understood and achieved under the “uniform 
ergodicity” assumptions due to the method by Atar and Zeitouni (see ED based 
on the Birkhoff metric (also known as projective or Hilbert metric). This method 
under such assumptions guarantees an exponential rate, with which the optimal 
filter algorithms “forgets” wrong - or unspecified - initial conditions. The method 
has been extended to the “non-uniform ergodic” case (see 0) by combining the 
application of Birkhoff metric with a modified version of the coupling method, which 
leaded to exponential or polynomial rates, with which the algorithm may “forget” 
wrong initial data. However, unspecified initial distributions is not the only option 
for an unspecified model. Small errors on each step of the algorithm (in discrete 
time case) is one more possibility to perturbe a model. In the “uniform ergodic” 
case it was also tackled in the literature (see 0)- The “non-uniform ergodic” case 
is still waiting for its investigation and our goal here is to attack this problem. In 
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our setting only “uniformly small” errors are allowed and conditions on the densities 
of the noise both in the signal and in the observations look rather strict, so that 
new studies will be required to weaken conditions so as to include a wider class of 
processes. 

The setting described earlier is not only insteresting as such: it may also serve as 
a base for studying unspecified models with an unknown parameter. In such models, 
observations should allow to estimate the parameter. Once the estimator is, at least, 
consistent, there is a hope that the filtering algorithm for a model with an estimate 
instead of the “true parameter” may be close enough to the exact model. Hence, 
the previous studies could be applied. This programme - again in the “uniform” 
case - was realised in M, where it was assumed that the estimator satisfies certain 
large deviation conditions. However, in many examples “non-uniform” conditions 
are more natural. Hence, a large part of the problem remains open and requires 
further investigations. We restrict ourselves to the case of exponential recurrence and 
exponential moments for the signal and postpone other cases till further research. 

The paper consists of three sections: Introduction, Setting and main result, which 
also includes some auxiliary results; Proof of auxiliary lemmata; Proof of main result 
- the Theorem [[] 

2 Setting and Main Result 

We consider the following model, with a non-observed (Markov) state process { X n , n > 
0} and an observation process {Y n ,n > 0}, taking value in M. d and respectively. 
We assume that the state sequence {X n , n > 0} is defined as a homogeneous Markov 
chain with transition probability kernel Q(x,dx'), i.e.: 

F[X n G dx 1 1X 0:n _i] \x n -i=x = P[ X n G dx' |X n _ 1 ]|j Cn _ 1 =* = Q(x, dx'), (1) 

for all n > 1, and with initial distribution /i 0 . 

We also assume that given the state sequence {X n ,n > 0}, the observations 
{Y n ,n > 0} are independent, the conditional distribution of Y n depends only on X n 
and that the conditional probability distribution F[Y n G dy\X n = x] is absolutely 
continuous with respect to the Lebesgue measure, i.e.: 

P [Y n G dy\X n = x] = T(x, y) dy , (2) 

for some Borel measurable with respect to the couple ( x,y) function T. The basic 
example which is to be covered will be the following: 
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We consider a discrete time filter for a Hidden Markov chain (X n ) with values in 
the Euclidean space M. d , with conditionally Markov observations (Y n ) also from M 1 
satisfying the system 


X n +i — X n + b(X n ) + £ n +i, (n > 0), (3) 

Y n = h(X n ) + V n (n > 1), (4) 

where (£ n , V n ) is a sequence of IID random vectors of dimension d + t with densities 
q^(x), qv(y), b(-) is a d- dimensional vector-function, h(-) an ^-dimensional vector- 
function, that is, 

Q(x, dx') = q^((x' — x — b(x))) dx', (5) 

V(x,y) = q v (y-h(x)) (6) 

(remind that q g and q v denote the densities of £i and V\ respectively). 

The problem addressed in this paper is as follows. Assume that the exact pa¬ 
rameters of the model (fTj) - (l2]l — i.e., the initial distribution /io, the transition kernel 
Q(x,dx' ) and the conditional density of the observations T(x, y) — are known with 
some errors or that we know only an approximations to the exact characteristics 
of this model. Hence, the statistician is unable to use the exact optimal filtering 
algorithm for estimation of X n at each time n, and he is left to apply a filtering 
algorithm with wrong parameters and with additional errors in the algorithm itself. 

Under such conditions, the goal is to investigate the asymptotic behaviour of this 
error in the available algorithm in the long run. It follows from the earlier results on 
the subject - see [1] - that it is sufficient to work with errors in the kernels assuming 
that initial distribution po is known exactly. (If not, it may be tackled by using the 
methods and results from [4].) More precisely the setting will be explained in the 
section 12.11 below. 

Throughout the paper, we denote the wrong transition kernel and conditional 
density of the observations by P(x, dx') and by S(x, y) respectively. Assumptions 
will be stated in the form of Q , P and T, 5 and examples will be provided in terms 
of the coefficients and properties of the original system ([3])-(j4|). 

2.1 Main result 

To explain the main problem addressed in this paper in detail, we should formulate 
exact and wrong filtering algorithms. Let us remaind to readers the exact filtering 
algorithm. The problem of nonlinear Eltering is to compute at each time n, the 
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conditional probability distribution y n of the state X n given the observation sequence 
Y 1:n = {Y u Y 2 , ... Y n , }, i.e.: 


» n (A) = fZ(A) = P[X n e A\Yi,... Y n ). 

Using Bayes’ formula, the exact posterior filtering conditional measure can be 
represented as a probability measure for any Y via the following random non-linear 
operator S applied to the initial measure y 0 , 

fln^dXn') fi n (dXnJ P^q £ dx n | 1 i, . . . Y n ) 


\ Qfa- 1 , dxi)d^°^(x i: Yi)fi 0 (dx 0 ) 


i =1 


c£° 


~[Q(xi- 1 ,dx i )'&(xi,YJno(dx 0 ) =: n 0 S^°{dx n ). 


i=l 


(7) 


Here 4is a conditional density of Y t at y*, given X, = and Q(x,dx') 
is a transition kernel for the Markov chain X n , n > 0. The random normalization 
constant c^° is defined as follows, 


c r° = 


i=i 


yi=Yi,—yi=Yi 


and, correspondingly, 


<C = 


"i-1 


cf° 


^.(nH'UVvfe) 




yi=Yi,... yi =Yi 


Hence, 

C n° = C n M ° = / TT Q(Xi_i, Ui)^o(^o)- 

Vltr + 1 

Now, the “wrong filtering algorithm ” can be formulated more precisely as follows. 
Recall that it is assumed that we do not know the transition kernel Q(x, dx') and the 
conditional density of the observations T (x, y ) exactly, but only some approximations 
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P(x, dx') and S(x, y) respectively. Hence we can define another sequence of measures 
(n' n {A )) n >i as follows: 


n 11 

fM n {dx n ) = / 

jRn t= i 

1 r n 

= vyy / JJP(xi_i,dx i )S(x ij y i )// 0 (da:o) =: A*o^ ,w, (d®n), 

C 'l jRn i = 1 

where the “wrong” normalizing constant cjj° can be defined as follows: 

« n 

/ TT P(xj_i, da^Efo, F;)/i 0 (Gfeo)- 


SMO _ 
°n 


The problem is to estimate the asymptotic behaviour of the difference: 


( 8 ) 

( 9 ) 


-^>0 11 dn 11 tv ' 

We may not hope that this discrepancy goes to zero as n —* oo, but just that under 
certain conditions it may remain small for all values of n. 


Assumptions 

(Al) — bounded (small) local perturbations - 
We assume that 


ln( sup 

x,x',z,y 


Q(x, dx')^(z,y) 

pi^kPW^) ) + H , s Z 


P(x } dx')E(z,y) 
Q(x, dx')'£(z,y) 


q < oo. 


(A2) — local mixing - 

We assume that for any R > 0 


C R =:sup 

Dr 


f Q(% 0 , dx') P(x 0 ,dx') \ 
v Q(v 0 , dx') ’ P(v 0 , dx ') ) °° 


with D r \= {{x Q ,v 0 ,x') : |x 0 |, |n 0 |, W\ < R, } 
(A3) — positiveness of conditional densities - 


T(x, 2 /)> 0 , E(x,y)> 0, Vx,y. 


( 10 ) 
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(A4) — condition of exponential recurrence in terms of the transition kernels: 
there exist q G (0,1), R , K, c > 0 such that for \x\ > R , 


exp(c\x'\)Q(x, dx') ) V ( / exp(c|a/|)Q(:r, dx') I < pexp(c|x|), (11) 


and for |x| > R, 


exp(c|x , |)(5(x, dx') ] V ( / exp(c|x / |)(5(x, dx') ] < K. 


( 12 ) 


This condition may be also re-written as follows: 

|a(x)| := |E^Xi| < q\x\ + K, & \a\x) := Ef^i| < q\x\ + K. 


(A5) — uniformly small influence of observations: there exists S > 0 such that 

(x,y) 


^(x,y) 

SU P ~n —,—r 

x,y,x' ^ i V) 


V sup 

x,y,x' 


(x',y) 


<1 + 5, 


and for p from (A4) 


(1 + 5)p < 1 . 


Although the assumption (A4) requires explicitly exponential tails of the noise 
in the signal, we give examples with both exponential and polynomial tails 
showing in what situations the assumption (A5) may be verified. Polynomial 
examples may be useful in the future. 


Sufficient conditions for the assumption (A4) in terms of the equation (J3]) and 
its approximation 


K+i = K + UK) + (Uv (n>0), 


y; = h(x' n ) + v; (n> 1 ), 


(13) 

(14) 


may be offered as follows. 
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(A4’) — another condition of exponential recurrence: there exist r > 0 such that for 
|x | large enough, 

\x + b{x )| V \x + b(x )| < |x| — r, 

and for any R > 0 


sup \x + b{x) | V sup \x + b{x) \ < oo, 

|x|>_R |x|>_R 


and 

E£i = E£( = 0, 

and hnally, there exists e > 0 such that 


Ecxp(e|£i|) +Eexp(e|^|) < oo. 


Remark 1. Apparently, (A^a) implies the following (notations are taken from the 
calculus in the proof - see below): there exists a constant C < oo such that for any 
probability measure n, 


sup 

y 


e^ x '^pQ y,IJ '(x, dx')n(dx) < C 


e £ ^n(dx). 


Equivalently, we could say that for any probability measure /i we have. 


sup E^ (e e|Xl| |Fi) <C e £lxl ia(dx). 


(15) 


The same holds true for the approximation kernel P. 


The following is the main result of the paper. 

Theorem 1. Let J e e|x| p 0 (dx) < oo. Then, under the assumption (Al) - (A5) 
above, there exists a constant C > 0 such that the following bound holds true: 

supE Mo ||/i(j — fi n \\ TV < Cq. (16) 
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Remark 2. Assumption (Al) is valid for example for the model (!3}) - (l4!) with q v = 
= C exp(—|x|) and with b : b(x) = b(x) if |x| > K for some K. 

Note that the value of q may be an arbitrary value greater than zero; however, the 
question is meaningful if this constant is small. It would be also nice to localize this 
condition, see the Remark^ in the sequel; we leave it till further studies. 

Remark 3. Let us show how the assumption (A5) may be checked. 


Example 1. Recall thaVit(x,y) — q v (y — h{x)). Let q v (y) = c(l + \y\) m , \y\>M. 
Assume that sup x \h(x) \ < 5' and this S' is small. Then for \y\ large enough, 


^(x,y) Qv(y ~ h(x)) 
q v (y — h(x')) 


(1 + \y — h(x)\) m (1 + \y — h(x')\) m 
(1 + \y - h(x')\)- m (1 + \y ~ h(x)\) m 


f l + \y - h(x')\ \ m f l + \y\ + 

\l + \y~h(x)\J “Vi + M-^V 


( l + S'/(l + \y\) 

VI -S'/{1 + \y\) 


« 1 + 2771(5'/(1 + \y\) < 1 + 2777.(5'. 


While for \y\ small enough, say, \y\ < M', 


V(x,y) = g v {y-h(x )) sup | y „_ y |< 6 , g v (y") 
ty{x',y) q v (y-h(x'))~ inf| y >- y \<s'q v (y') 


If qv > 0 everywhere and is continuous, then the latter right hand side is close to 1 
uniformly in \y\ < M'. 


Example 2. Let q v (y) ~ c( 1 + \y\) m , \y\ oo. Assume that sup x \h(x)\ < S' and 












this S' is small. Then for \y\ —$■ oo 


V(x,y) = Qv(y~ h ( x )) 

qv(y-h(x')) 

(1 + \y — h(x)\)~ m (1 + \y — h(x')\) m 

{1 + \y - h(x')\)~ m ~ (1 + \y-h(x)\) m 


n + \ y -h(x')\ y f i + \y\ + 

\l + \y-h(x)\J “Vi + M-^V 


( l + S'/(l + \y\) 

Vi-5V(i + |y|) 


ps 1 + 2 mS 1 /(1 + \y\) < 1 + 2 mS'. 


For y bounded, the ratio remains close to one if S' is small enough. 


Example 3. Let g v (y) = cexp(— \y\) for \y\ > M, and g v be continuous. Assume 
that h is bounded and small: \h(y)\ < S'. Then, for any value of y, 

^{ x ,y) = qv{y - K x )) 

x ',y ) qv{y-h(x')) 


cexp(-\y - h(x )|) 
cexp(—|y - h(x') |) 


expHy - h(x )| + | y- h(x')\) 


< exp (2S') ps 1 + 2 S'. 


Example f. Let g v (y) ~ cexp(—1?/|) as \y\ —$■ oo (equavalent, i.e., the ratio 
converges to one), and let g v be continuous. Then, for \y\ large enough we have, 

v(x,y) = qv(y - K x )) 

^{x',y) q v (y — h(x')) 

_ q v (y ~ h(x))/cexp(-\y - h(x) |) cexp(-|y - h(x )|) 
q v (y - h(x'))/cexp(—\y - h(x') |) cexp(-|y - h(x ')|) 

~ exp(— \y — h(x)\ + \y — h(x') |) < exp(2<5') ps 1 + 2 S'. 

And for \y\ bounded the ratio remains close to one due to continuity. 

NB. For Gaussian densities the assumption (A5) apparently fails. This could be 
possibly overcome if we manage to do some ’’localisation”. 
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Remark 4. It would be interesting to replace the Assumption (Al) by a local as¬ 
sumption of the type, 


Q(x,dx')4/(z,y) P(x, dx')E(z,y ) 

11 x,x'!t P yeK P(x, dx')E(z,y) n *, s', zjye* Q(x, dx')'&(z,y) K °° 


with qx small, perhaps, in addition to 

Q(x,dx')4>(z,y) P(x, dx')E(z,y ) 

(with q arbitrary and finite) and to change the current statement of the Theorem [7] 
to the following one: the bound holds true, 


Ejhi'n ~ Pn\\ TV < Cq K + In sup E x exp (of), (17) 

xeK 

or, possibly, 

£JK - TnWrv < C( lK + gill sup E x exp (of), (18) 

x£K 

or likewise. At the moment it is a conjecture that one of the bounds 11611181) may 
hold true under less rigorous conditions than those in the Theorem 0 


2.2 Auxiliary results 

In |4]J it was proved, in particular, that under the “exponential” assumptions equiv¬ 
alent to (A4) the following estimate holds true: 

- V Q S $’ V °||tv < C'exp(-CV). (19) 

Here, we need some minor modihcation of (TT6l) . Recall that the proof of this 
estimate was based on the inequalities (14) and (20) from |4]. In turn, (14)/[4] 
followed from (11) and (12)/[4j, while (20)/14] was a corollary from the results about 
mixing for the recurrent and ergodic signal process. What is important for the 
present paper, is that the basic inequality (12) ||4] admits an improved version under 
the condition that the initial Birkhoff distance (13) between measures /io and uq is 
finite: 

p(lMhVo) < oo. 

In [4] this was not assumed and there was no reason for using such an improved 
version; on the contrary, the absence of this assumption allowed to cover a wider 
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class of processes. However, now this will be important and the new version we need 
is as follows: 

P^Pni ^n) Ui Pi.i.P'ni ^n) i {ym Pn)') — CtT ^ p(/iO; Z/q), (20) 

with some C > 0 and ttr < 1. We do not explain here what are exactly k,p n ,v n , 
et al. because it would require to copy several pages from [4j, but use the notations 
from [4] verbatim. The point is that as a result of these improvements, we now 
formulate a version of Theorem 1 from [4] as follows. 


Theorem 2 [version of [4j] 

Let < «. Then under the assumes (A2> - (M), the follounnt, 

bounds hold true: there exist constants C, C(/i), a, e > 0 such that 


with 


E ro\\PoS^° - \\ TV < C(/i 0 ) exp(—cm)p(/i 0 , z'o), 

C(p) < / exp(e|x|) p(dx). 


( 21 ) 


Note that we do not assume (Al) here because the statement relates only to identical 
kernels: P = Q, and also S = T, in which case (Al) holds automatically with q = 0. 

Also note that both versions - the Theorem 2 above and the Theorem [Tj from [4] - 
could be combined with the help of the value 1 A p(po, uq) in the right hand side. 


Lemma 1. Under the assumption ( A4’), the assumption ( A4) holds true. 

Lemma 2. Assume that (A4) holds true. Let f = ?r = inf(f > 0 : \X t \ < R). Then 
there exist C, c, K, A > 0 such that 

E x exp(Ar) < Cexp(c|x|), sup E x e c ^ < (.K + e c ^), 

t>o 

( 22 ) 

E x exp(Af') < C exp(c|x|), supL^e^ 1 < {K + e c|x| ). 

t> o 

(Recall that the process X' is built via the kernel P and t' via (X')). 
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Lemma 3. Assume (Af) and (A5). Then for the “conditional kernels” 


Q y ' fl (x,dx') := Q(x,dx') 


^(x',y) 


and 


'P’^&dx') ■= P(x,dx') 


ff Q(x,dx 1 )^(x 1 ,y)p,(dx) ’ 
E(x',y) 


ff P(x,dx 1 )E(x 1 ,y)p,(dx) ’ 
the following bounds holds true: for |x| large enough, 

and with any R > 0 for |x| < R, 


(23) 


e *'lQV,i‘( Xt d x ’)l v | / z c Wpy-“(x,dx')\ < K', 


(24) 


with p' = ^(1 + <5) and K' = K (1 + S). Moreover, since p' < 1 (see the Assumption 
(A5)), the following inequality holds true, 


sup E x e c|x *’ y < AT' + e'H 
t> 0 


(25) 


3 Proofs of auxiliary Lemmata 

Proofs of Lemma |J] and Lemma [2] based on [2] and j3j with some changes will be 
added in the next version of this preprint. 

Proof of Lemma\f^ We have due to (A5) and (A4), if |x| is large enough, then 

E x e clx 'i Y] = j e c h'lp y,M ( X; dx') 

<(l + 5)/^V(,.^)<(l + ^. 

If |x| remains bounded, say, |x| < R with some R > 0, then 


12 





Now we estimate, 


Further, 


£ e cl*» I < ldX^I > R)p'e c + ldx'^l < A)AT' 

= p , e c|x "- 1 ' - l(KAl < fl)e c|x »-‘' + lflX&l ^ R ) K ' 
< + K'. 

E Y , yY e c|x "’ V| < E x , tY (p'e c]x '^ + K') 

X n -2 X n-2 \ J 


< K' + p' (p'e c \ x ^ T A") , 


and eventually by induction, 


E x e c ^ Y ! < + (//) n e c M < - 2 ^- + e ' 

1 — p 1 — p 


c\x\ 


The Lemma [3] is proved. 


4 Proof of Theorem Q] 


1. We will use the Birkhoff metric for positive measures, see [5], and also m. i 
(where it is called Hilbert metric; one more synonym is the projective metric), 


p(/i, v) 


(infs: p<sv) 
(sup t : p > tv) ’ 
Too, 


if hnite, 
otherwise. 


Another equivalent definition reads, 


p(/i, v) 


In sup (dp/dv) T In sup (dv/dp), if hnite, 
Too, otherwise. 


(26) 


For any measure p we can define the following nonlinear operator Sfc :n : for k < n 


pS k:n (A ) = pS Y k 


A It* 

. n (A) = c k:n l(x n eA) Q(x j -i,dx j )'&(x j: y j )p,(dx k ), 
J ^ n ~ k j= k +i 
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with a normalizing constant c k:n : 


Ck:n — c k . n — 


Q(x j -i,dx j )^(x j ,y j )ii(dx k ) 


j=k +1 


-1 


k < n, 


and for k = n we let nS k -.n{A) = n(A). Denote 

Lr 1 ^' e A)^(x,y)Q(x,dx t )fi(dx) 


k^{A) : = 




and similarly fiP y is defined, 


Now we have: 


= / RxR l(a:' e A)E(x,y)P(x,dx')fx(dx ) 
11 ’ f RxR Z(x,y)P(x,dx')/t(dx) 

n 

f^n ~ k'n — ^ ^ {^k^k:n ~~ l^k-l^k-lin) ■ 
k =1 


(27) 


It was noted in [6] that the following important property holds true: 

k'k—l^k—l-.n = (hfc—1 Q)Sk:ni 


Yk ■ 


where the operator Q = Q is defined as 


rph,, /\ J Q(x,dx')^(x',Y k )n(dx) 
fiQ (dx ) = 


or, 


Q >k (dx') = Q(x,dx') 


ff Q(x,dx 1 )'P(x 1 ,Y k )fj,(dx) ’ 
*(x',Y k ) 


ff Q(x,dx 1 )^(x 1 ,Y k )fi(dx) ’ 


For the reader’s convenience we recall the reasoning. Indeed, 

k^k:n Tj k'^k-.m 

C k:n 

where the linear non-normalized operator S k:n is dehned as follows: 

« n 

liS k:n (A) = nSX. n (A) = / l(x n e A) ^(x j ,Y j )Q(xj- 1 ,dx j ) 

/fCPn—fc 

j=k +1 


(28) 
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Therefore, we have, 


P n 

nS k:n (A) = / l(x n eA) TT ^(x j ,Y j )Q{x j -i,dx 


j=k +2 


Q(x k , dx k+1 )^(x k+ 1 , Y k+ 1 ) jd{dx k ) = i/S' fc+ i :ri (A), (29) 


with the non-normalized measure is(dx k+ 1 ) defined by the formula 
v(dx k+ 1 ) = / (3(x fe ,dx fe+ i)^(a: fe+ i,r fe+ i)^(dx fe ) 


Hence, 


_ l^Sk-.r 

H'^kin Jl. 


V Sk-\- 1:ti ^*S/c+l:n ^fc+l:n 


^k:n 

Also note (follows from the calculus with A = 


°fc+l:n °/c:n 

) that 


(30) 


r M = r u 
L 'k:n L 'k+l:rv 


because 




(n — k — l) 


l{x n eR d ) ^(x j ,y j )Q(x j _ 1 ,dx j )- 


j=k +2 


C(x fc ,dx fc+ i)T(x A;+ i,|/ fc+ i)/i(da: fc ) = i/S' fc+ i :n (R d ). (31) 


The equation (1301) implies that 

^*5fc+l:ri (^/^(®0)*5fc+l:n ^^fc+lir 


H'Skm 


“'fc+l:n 


Cfe+Un/ 1 ' 


r u 

L 'k-\-l:n 


with 


'/i ,\ ~Y k( , A ^eh' TT^fc/, A f Q(x,dx)V(x ,Y k )fi(dx) 

'(dx ) = v k (dx) = = vQ (dx)= - w , 

JJ Q(x,dx 1 )W(x 1 ,Y k )/j,(dx) 


So, indeed, the announced important property (1281) holds true. 
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Further, since p! n S n:n = p! n and poSo , n = p n and because p' 0 = po and p' 0 S o :n = 
PoS 0 -.n = p n , we obtain, 


A^n /^ri — P Sk-.n (l^k-lQ) ^k:n) i 


k =1 


where fJ.' k _\Q = p' k -\Q *\ /4-i-P = /4-i-P *• So, it follows that 


11/4 - /^i||tV < ^ ll/^fc-l-P - (/^fc_iQ)-S'fc : n||TV, 

fc=l 


and 


-^Moll/4 _ /FiHtV < ^-^o 11/4-1 P K S k:n ~ {tfk-lQ ^ ) 'S'fcin || TV- (33) 


TThvr? 


fc=l 


(32) 


2. By virtue of the Theorem 2 under our assumptions we have, 

£ A V'll/ t ‘ S 'o:n - Z'S'o^IItV < C([M, Z/)e~" n p(/l, Z/), 
where a does not depend on the initial measures, while C(p, v) admits a bound 


C(p, v) < I (e c ^n(dx) + e c]x '^(dx')) 


c\x'\ 


with some c > 0. 


Also recall that due to the Lemma [2] (here the process X' corresponds to the 
kernel P), 


sup E^e 6 ^ < K + / e 6 ^' 1 po(dx), 

t> o J 


sup E vo e € W <K+ e elx ' l v 0 (dx'). 

t> o J 


3. Further, all of the above imply that 


J l^k j 




■"^fe+l v»^n 0 Y k+ 1 ,...,Y r 


O 

VkO. 


k:n II TV 


Y u ...,Y k 


<C(fx k ,u k )e- a ^p(^,u k ) 


< Ce~ a ^p(p k , u k ) / e^l (tf^idx) + i%’' Y \dx)), 


(34) 


(35) 
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where the constants C and a are non-random and do not depend on k. Denote 

D M :=fe^dx) + v{ dx)). 

By virtue of the inequalities (l33|i and (l35|) . we have, 

EnoWrfn ~ k’nWrv 


<Y, CE ^ D (^^-iQ 


' ,/ fy* k ^p— a i n ~ fc) 






k =1 


< Cq Y1 E *> D Wk,t J 'k-\Q 


i,'. r> Yk \ e - a ( n - k )' 


k =1 




But n' k = /U / fc _ 1 P (the same operator but with the wrong kernel), so we obtain 

p(/4X-i<2 Vfc ) = pip'k-J^^iA-\Q Yk ) < Q- 

Thus, 

n 

E „IK - fcll tv < Y. Cqe-° {n - k) E n D(nl /4-iQD (36) 

k =1 

4. It remains to estimate the term so as to show that it does not exceed some finite 
constant uniformly in k. We have, 


E k D(A, #4-i Q *) = E„ J e^^(dx) + E n j e^A-iQ ‘(*0- 

Here the hrst term in the right hand side satisfies the bound 

(sup) E IJo [ e c|a V fe (ote) < C [ e°Wno{dx) < oo, 


according to the first part of 
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Let us inspect the second part of the right hand side. We have, 


E K f e‘ W (4-i Q rt (dx) = E„j 

< 5) (1 + i)E m J e c| "'(/4-i Q)(dx k ) 

< K( 1 + S) + (1 + S)E m h+'fjLil*) 

< K' + AT + (1 + 5) / e c|a;| /io((ix) < cx), 

according to (1251) and the assumption (A5): see the Lemma El The Theorem [1] is 
proved. 


Acknowledgements 

For the second author, the article was prepared within the framework of a subsidy 
granted to the HSE by the Government of the Russian Federation for the implemen¬ 
tation of the Global Competitiveness Program. The same author is also grateful for 
support of the RFBR grant 13-01-12447-o£-m2. 


References 

[1] Atar, R., Zeitouni, O. Exponential stability for nonlinear filtering. Ann. Inst. H. 
Poincare Probab. Statist. 33(6), 1997, 697-725. 

[2] Gulinskii, O. V. , Veretennikov, A. Yu. Rate of mixing and the averaging principle 
for stochastic recursive procedures. Aut. Remote Control. 51(6), 1990, 779-788. 

[3] Gulinsky, O. V., Veretennikov, A. Yu. Large deviations for discrete - time pro¬ 
cesses with averaging. VSP, Utrecht, The Netherlands, 1993. 

[4] Kleptsyna, M. L., Veretennikov, A. Yu. On discrete time ergodic Liters with 
wrong initial data. Probability Theory and Related Fields, 141, 2008, 411-444. 

[5] Krasnosel’skii, M. A., Lifshits, E. A., Sobolev, A. V. Positive linear systems. The 
method of positive operators. Heldermann, Berlin, 1989. 


18 


[6] Le Gland, F., Oudjane, N. Stability and uniform approximation of nonlinear 
filters using the Hilbert metric and application to particle filters, The Annalas of 
Applied Probability, 14(1), 2004, 144-187. 

[7] Papavasiliou, A., A uniformly convergent adaptive particle filter, J. Appl. Prob. 
42, 2005, 1053-1068. 

[8] Papavasiliou, A., Parameter estimation and asymptotic stability in stochastic 
filtering, Stochastic Processes and their Applications, 116, 2006, 1048-1065. 


19 


